SYSTEM AND METHOD FOR HANDLING FLOWS IN A NETWORK 


Claim to Priority 

This application claims the benefit of United States Provisional Application No. 
60/149,174, filed on August 17, 1999. 

5 Field of the Invention 

The field of the invention is handling flows in a network, and in particular handling 
packets that relate to the same conversation as a part of a flow. 

Brief Description of the Drawings 
JO Figure 1 shows a switch that handles a flow between two hosts in accordance with an 

; :0 embodiment of the present invention. 

I y Figure 2 shows a flow that passes through one Ethernet switch between two hosts in 

accordance with an embodiment of the present invention. 
W Figure 3 shows flows between two switches and two hosts in accordance with an 

» 1 5 embodiment of the present invention. 

j'S Figure 4 shows multicast flows in accordance with an embodiment of the present 

I'* invention. 

Summary of the Invention 

20 A flow in a network is identified and handled by using a virtual host address. A packet is 

received at a switch with a first virtual host address as its destination address. If the packet is the 
first packet of a flow received by the switch, then a second virtual host address is determined by 
the switch. The first virtual host address is stored in a packet forwarding table correlated with the 
second virtual host address. A subsequently received packet of the same flow has the same first 

25 virtual host address as its destination address, and is forwarded to the second virtual host address 
in accordance with the packet forwarding table. 

Detailed Description 

The wide area network is evolving to one that integrates virtual circuit switching (label 
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swapping) for flows with conventional datagram forwarding. A first step along that road was 
described by Ipsilon by Newman, P et al, in IP Switching - ATM Under IP . IEEE Trans, on 
Networking, Vol .6, No.2, April 1998, which: 

a) uses a classification algorithm to detect flows among the influx of IP packets; 

b) uses IP datagram forwarding to determine where to send the packet; 

c) creates a virtual circuit connection through the switch to the same place that the IP 
packet is being sent; 

d) transmits the VCI of that connection to the upstream switch with an indication 


We have modified this concept to provide flow switching on local area networks (LANs) 
that use Ethernet. Figure 1 illustrates a switch that handles a flow between two hosts, H and K. 
Usually, Ethernet addresses are of hosts rather than endpoints of flows. Our design uses 
Ethernet addresses to also identify flows on the LAN. It is exactly as if the switch contains one 
virtual host for every flow. The Ethernet address of that virtual host, referred to here as V, is 
temporarily assigned from a block of locally administered Ethernet addresses. Packets of a flow 
from host H to host K pass through the virtual host V. The source and destination addresses in 
packets leaving H are H and V respectively. Packets traveling from V to K have source and 
destination addresses equal to V and K. The switch performs Ethernet address swapping as 
follows: 

a) the destination address of an incoming packet is moved into the source address 
field; and 

b) a new destination address is obtained from a "VC forwarding table" held within 
the switch. 

The technique is compatible with existing applications of Ethernet because in effect all 
we have done is to add extra (virtual) hosts to the network. 


e) 


that subsequent packets should be encapsulated with that VCI; and 
arranges that incoming packets encapsulated with that VCI are switched not 


routed. 
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Whereas the Ipsilon technique used a classification algorithm to detect flows among IP 
packets, we have experimented with the idea that the host application should make that decision. 
We have added a single byte, vc_flag, in the general socket structure of our hosts to say that the 
application wants special service for the flow of packets passing through the socket. The - 
presence of that flag tells the socket software to use a virtual host Ethernet address instead of the 
destination Ethernet address implied by the IP header. 

The switch does traditional Ethernet packet forwarding on all packets except those that 
are addressed to a virtual host. Packets addressed to a virtual host are switched using data in a 
VC forwarding table. The first packet for a new flow causes an entry to be made in the VC 
forwarding table based upon the IP destination contained in the packet. 

By this means we have created in the local area a sufficient means to provide quality 
communication service on a per-flow basis. When the technique is matched to flow switching in 
a wide area network the user has full benefit of end-to-end flow switching, from a socket in one 
host to a socket in another. This has been achieved with minimal impact on host software, no 
interference with existing applications, and complete compatibility with existing Ethernets. 

Ethernet RFC 894 packet format 


DESTINATION ADDRESS 
SOURCE ADDRESS 
TYPE 
PAYLOAD 


PAD 
FRAME CHECK 
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The Ethernet frame format illustrated above consists of a destination address, source 
address and protocol type indicator followed by the payload and a frame check. In the following 
diagrams which describe how Ethernet addresses are manipulated during switching, we are only 
" interested in the destination and source addresses. So Ethernet packets will be represented thus: 


DESTINATION 


SOURCE 


Certain Ethernet addresses are used to identify flows. This is done in such a way that 
network software in the host computers connected to the network work under the impression that 
the Ethernet, as always, is a device for sending datagrams (individual packets) from one 
computer to another. An Ethernet switch that supports flows behaves as if it contains within it 
one virtual host for every flow. 


Figure 2 illustrates a flow that passes through one Ethernet switch between hosts H and 
K. The flow is represented in that switch by virtual host V. Host H is connected by an Ethernet 
to port PI of switch S, and P2 is connected by Ethernet to host K. Within the switch, incoming 
packets with destination address V are routed according to the table shown in the lower block. 
Packets arriving with host address H are rejected if they did not come from port PI. Likewise, 
packets from K are rejected if they did not come from port P2. 

The packet forwarding process first copies the destination address (V) of the incoming 
packet into the source address of the outgoing packet and then it copies the new destination 
address from the table. Host K is the destination for packets coming from H, and host H is the 
destination for packets coming from K. 

The same procedure applies when switches are connected in tandem. Figure 3 illustrates 
the case when there are two switches between hosts H and K. 

As is usual with Ethernet switches, the IP addresses and Ethernet addresses of hosts 
attached to a particular port are discovered by scanning packet source addresses or by using ARP. 
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The packet forwarding table used by each virtual host is constructed by examining the header of 
the first IP packet in a flow. 

Of course, virtual hosts do not really exist, even as processes within a switch. It is just - 

that the actions of a switch as seen from outside are exactly as described by the model Internally 
the switch uses a combination of technologies found today in IP routers and virtual circuit 
switches. It is a table-driven process that stores packets in queues, processes their headers and 
transfers them to the appropriate output ports with appropriate attention to the quality of service 
appropriate to each traffic class. 

The same technique can be used for point to multipoint flows, as shown in Figure 4. In 
this example, host H is the root of a multicast tree that transmits packets to the two hosts K and 
L. The forwarding table now has three rows, one for each host in the multicast, and a third 
column indicates which host is the "root" of the multicast tree. Packets coming from H are 
copied to each of the hosts given in the other rows of the table. Packets addressed to V from K 
and L may either be rejected or propagated upstream depending upon the permission stated in 
the"perm" column. Note that if K and L do transmit packets upstream, H must examine the IP 
header to determine the source of each packet. 

An example of a virtual circuit signaling connection set-up protocol follows. A protocol 
for setting up a connection between two hosts A and B takes place in three stages. First A 
requests that the connection be made, then B accepts the request and causes a virtual circuit to be 
created, and finally A confirms that indeed there is a connection. 

The connection request is sent as an ordinary IP datagram from A to B. The accept 
message is sent as a signal, which is a message from A to B that is flagged for special attention in 
each of the network nodes along the way. As this signal progresses through the network a (full 
duplex) virtual circuit is created between A and B. Finally, the confirmation message from A is 
transmitted over the new virtual circuit. 


• ♦ 


A socket number is an identifier chosen by a host to represent one end of a connection. 
Socket numbers for successive conversations should be different one from another so that a long 
time will elapse between repeated use of any one socket number. This allows any messages 
involved in a connection set-up to be retransmitted without ambiguity. For IPv4 the socket 
number is synonymous with port number as used by TCP or UDP. In other words, as is well 
known in art, a port number is associated with a socket number, and this association of a port 
number to a socket can change over time. See, for example, W.R.Stevens "Unix Network 
Programming", Prentice Hall Software Series, April 1990, Chapter 6, "Berkeley Sockets", pages, 
258-304. 

The connection, accept and confirm message are coincident with the IP packets which 
normally start a TCP virtual circuit connection on the Internet. A TCP session begins with the 
following 3-way handshake: Client host A chooses a port number and sends a SYN message to 
server host B. B chooses a port number, and sends a SYN message to A. A can then use the 
connection, and sends an ACK message to B. B then understands that it can also use the 
connection. 

Implementation of the TCP virtual circuit as a switched flow at layer 2 takes place 
concurrently with step two of this handshake. No extra packets need be transmitted. 

The embodiments described above advantageously protect the confidentiality, integrity 
and authenticity of a conversation represented by a flow. As used herein, protecting 
"confidentiality" means preventing unauthorized access to the contents of the flow. Protecting 
"integrity" means preventing the unauthorized manipulation or alteration of the flow. Protecting 
"authenticity" means providing some assurance that the purported source of a packet is the actual 
source of the packet. As shown in Figure 2, the VC fowarding table stores a list of allowed hosts 
(real and virtual) from which packets may come, and to which packets may be sent. Also, switch 
S stores the port number through which switch S communicates with each host. When a packet 
from H arrives at switch S through port P, switch S searches the VC forwarding table for a record 
that correlates the source address of the packet with the port number through which the packet 
has arrived. If such a record is not found in the VC forwarding table, then the packet is rejected. 


In other words, if a packet arrives through the wrong port or from an unknown source, then the 
authenticity and/or integrity of the packet is suspect, and the packet is rejected. This should be 
implemented for both virtual and real host addresses in all of the switches handling a flow to 
maximize security. .... ._ 
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The above description is meant to illustrate, and not limit, the scope of the present 
invention. For example, although Ethernet and Internet protocols were discussed in illustrating 
various embodiments, any suitable protocols can be used in accordance with the present 
invention. Other embodiments of the present invention will be understood and appreciated by 
1 0 those skilled in the art from the present disclosure. 


